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ABSTRACT 

We consider the shape of the posterior distribution to be used when fitting cosmological mod¬ 
els to power spectra measured from galaxy surveys. At very large scales, Gaussian posterior 
distributions in the power do not approximate the posterior distribution Vfi we expect for 
a Gaussian density field Sk, even if we vary the covariance matrix according to the model 
to be tested. We compare alternative posterior distributions with Vr, both mode-by-mode 
and in terms of expected measurements of primordial non-Gaussianity parameterised by /nl- 
Marginalising over a Gaussian posterior distribution Vf with fixed covariance matrix yields 
a posterior mean value of /nl which, for a data set with the characteristics of Euclid, will 
be underestimated by A/nl = 0.4, while for the data release 9 (DR9) of the Sloan Digital 
Sky Survey (SDSS)-III Baryon Oscillation Spectroscopic Survey (BOSS) it will be underesti¬ 
mated by A/nl = 19.1. Adopting a different form of the posterior function means that we do 
not necessarily require a different covariance matrix for each model to be tested: this depen¬ 
dence is absorbed into the functional form of the posterior. Thus, the computational burden of 
analysis is significantly reduced. 

Key words: methods: statistical - cosmology: large-scale structure of Universe - (cosmol¬ 
ogy:) inflation. 


1 INTRODUCTION 

Forthcoming galaxy surveys, such as the Dark Energy Spectro¬ 
scopic Instrument (DESI; Schlegel et al. 2011), Euclid (Laureijs 
et al. 2011)^ and the Square Kilometre Array (SKA) will con¬ 
strain the Universe’s expansion history, geometry and the growth 
of structure with unprecedented accuracy. The basic statistics con¬ 
taining large-scale structure information are the 2-point cluster¬ 
ing measurements, the correlation function and the galaxy power- 
spectrum P{k). As they form a Eourier pair, their information 
content is the same and we focus only on the latter in this work. 
The linear galaxy power spectrum encodes a wealth of information 
about the physics of the Universe, allowing us to constrain cosmo¬ 
logical models with baryon acoustic oscillations (BAO), gravita¬ 
tional models with redshift space distortions (RSD) and inflation¬ 
ary models with primordial non-Gaussianity, parametrised to first 
order by /nl. In order to do so, one has to know the likelihood 
and/or posterior of power-spectra, which for simple cases can be 
calculated analytically. For general cases, one usually assumes the 
likelihood or posterior to be multi-variate Gaussian with a covari- 
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ance matrix Cij = {P(ki)P(kj)). The estimation of the covari¬ 
ance matrix is a critical step in the analysis of data. Internal meth¬ 
ods such as the sub-sample, jackknife and bootstrap methods have 
been widely used in the past, but Norberg et al. (2009) have shown 
that they are not able to faithfully reproduce variances. Robust es¬ 
timates are often instead obtained from mock galaxy catalogues. 
In recent analyses of the Baryon Oscillation Spectroscopic Survey 
(BOSS; Dawson et al. 2013), these were generated from second or¬ 
der Lagrangian Perturbation Theory matter fields using a friends- 
of-friends group finder (Davis et al. 1985) to find haloes (Scocci- 
marro & Sheth 2002; Manera et al. 2012). Their masses were cal¬ 
ibrated by comparisons with N-body simulations. A Halo Occupa¬ 
tion Distribution then prescribed how to populate these haloes with 
mock galaxies, and the geometry and the efficiency of the survey 
were sampled. Alternative methods for producing mocks include 
N-body simulations, comoving Lagrangian acceleration (COLA; 
Tassev, Zaldarriaga & Eisenstein 2013) simulations or alternative 
simpler methods such as pinpointing orbit-crossing collapsed hier¬ 
archical objects (PINOCCHIO; Monaco, Theuns & Taffoni 2002) 
or effective Zel’dovich approximation mocks (EZmocks; Chuang et 
al. 2015). The covariance matrix is then the sample variance of the 
power spectra from the different mocks (Manera et al. 2012; Taylor 
et al. 2013; Percival et al. 2014). The covariance matrix computed 
from the mocks will depend on the cosmological model that was 
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Table 1. Notation used for probabilities. 


symbol 

name 

description 

ZiP) 

evidence 

probability of the data P 

L(P\Ph) 

likelihood 

probability of the data P given the hypothesis Ph 

V{Ph\P) 

posterior 

probability of the hypothesis Ph given the data P 

n(Pir) 

prior 

probability of the hypothesis 


used to generate them. It is computationally costly to produce mock 
catalogues for each possible cosmological model and set of param¬ 
eters to be tested, so one usually chooses a cosmological model 
which will produce a P(k) reasonably close to the measured one 
and uses the covariance matrix computed from the mocks created 
assuming that model. This approximation does not hold in general, 
especially at large scales. In this article, we study other ways of ap¬ 
proaching this problem, including using approximations to the true 
posterior distribution to obtain accurate inferences without requir¬ 
ing a covariance matrix for each cosmological model. We apply 
the most suitable of these approximation and the true distribution 
to provide a probability distribution function (PDF) for measure¬ 
ments of the non-Gaussianity parameter /nl- Our result will pro¬ 
vide a complementary method to analysing /nl directly from 5{x), 
as described in Verde et al. (2013). 

We proceed as follows. In Sec. 2, we test the standard Gaus¬ 
sian posterior shapes mode-by-mode on a toy example where we 
measure the power spectrum itself. We do a similar test in Sec. 2.5 
to study the impact of using different posterior distributions on a 
real survey, i.e. the Sloan Digital Sky Survey (SDSS)-III Baryon 
Oscillation Spectroscopic Survey (BOSS). In Sec. 3, we study al¬ 
ternative posterior shapes for power spectrum estimates inspired 
by CMB analyses. We test the most promising posterior distribu¬ 
tion by postdicting a /nl -measurement for a data sample like the 
data release 9 (DR9) of BOSS and we make predictions of Euclid 
/nl -measurements in Sec. 4. We conclude in Sec. 5. 

Throughout this work, we adopt a Bayesian framework and 
mark observed data with a hat, e.g. P and quantities related to the 
hypothetical model with an H, e.g. Ph- We denote probability dis¬ 
tributions with different letters L, V, 11 and Z, which we define in 
Tab. 1, to make clear whether they depend on data and/or the model. 
The ubiquitous Bayesian equation thus reads for this example 


p[Ph\p) 


L (p|Prf) n(Prr) 

W) 


( 1 ) 


2 THE POWER SPECTRUM LIKELIHOOD 

In this section, we elaborate the analytic likelihood and poste¬ 
rior functions of the galaxy clustering power spectrum assuming 
a Gaussian density field. We consider this posterior function as the 
’’truth” and compare it to commonly used approximations of the 
galaxy power spectrum posterior function for single modes, which 
we shall introduce in Sec. 2.3. 


2.1 The True Distribution of 1| Under the Assumption of a 
Gaussian Density Field 


The positions of the galaxies in a survey can be transformed into a 
galaxy over-density field 


n{x) 


( 2 ) 


where n(x) is the measured galaxy number density and n(x) the 
expected value. Fourier transforming S{x) yields 

1 


Sk = 


V 


d'^xS(x) exp(ikx) 


whose covariance matrix 


{Sk^Sl^) = 


(271 


V 


-Soiki - k2)P(ki) 


(3) 


(4) 


is given by the power spectrum P{k). Following the standard as¬ 
sumption that 5h forms a Gaussian random field, the probability 
of measuring a particular value of the real and imaginary parts 
(^Su,Sv'j of a single 5k = 5u + iSv is a zero centred Gaussian 
distribution with standard deviation half the true power ^Prik): 


We use the letter Z here, because we have assumed that the true 
power is known, i.e. the distribution only depends on the data (c.f. 

- /- 2 2 

Tab. 1). The distribution of the absolute value |(5fc| = y (5u -I- 
is given by a Rayleigh distribution: 


Zr = y y dS^Z .Z ( 5 ;) 5 d 


2\S^\ 

Prik) 


exp 





+ Sv 


( 6 ) 


Throughout this article, we regard Eq. (6) as the ’’true” distribution 
of |(5fc| to which we compare several approximations later. 

Any model dependence enters the Rayleigh distribution only 
in the covariance of the density field, which is equal to the true 
power spectrum. The position of the distribution’s peak equals the 
value of the true power. Measurements of 5k have been used to 
make cosmological inferences when they have been further decom¬ 
posed into spherical harmonics and spherical Bessel functions, be¬ 
cause radial and angular modes can be distinguished, allowing an 
easy analysis of redshift-space distortions. However, this method is 
rather complex and computationally expensive (Heavens & Taylor 
1995; Percival et al. 2004). It is difficult to linearly compress 5k 
efficiently maximally retaining information. 


2.2 The Posterior in Terms of the Power 


We can rewrite the Rayleigh distribution in terms of the power. We 

replace Prik) with Puik), and 5k with \JP{k) in equation (6) 
which in this way depends on both data and model, and hence be¬ 
comes a likelihood (c.f. Tab. 1): 


Lr (p(fe)|Prr(fc)) 


‘2\jP{k) ( p(fc) \ 


(7) 
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We can use Bayes’ theorem (cf. Eq. (1)) to find the posterior. It is 
standard to assume a uniform prior 


n(Prr(fc)) 


1 

^max(fc) 

0 


, if 0 ^ Pir(fc) ^ Pmax(fe), 

otherwise, 


( 8 ) 


modes. Note that we assume that the widths and positions of the 
fe-bins are such that window effects are negligible (Feldman et al. 
1994) and different modes are independent. 

As in Sec. 2.2, we replace Prik) with Pnik) in equation (15) 
making it a likelihood (c.f. Tab. 1): 


which requires an arbitrary choice of Pmax(fc). We assume that 
Pmax(fc) is far in the right tail of the likelihood such that 

is effectively constant and hence acts only as a normalisation factor. 
Thus, for the ’’true” posterior we have 


P«(Prf(fe)|P(fc)) 


Lr (p(fe)|Prr(fe)) 
JAPhLr (p(fe)|Prr(fe)) 


\/P(fe) ( P(k) \ 

P^®^Pl^-p^j- 


(9) 


As P is a constant in the posterior, one can rewrite Eq. (9) such that 
the log-posterior only depends on the ratio P(fe)/Pif (fc): 

-2 InfPfl) = 2M In I \ + 2M + const. (10) 

V 

Then we follow the method of Hamimeche & Lewis (2008) and 
introduce 


L (p(fe)|Prr(fc)) 


exp 


1 [P(fe)-PH(fe)]^\ 
^ gfc ) 


(16) 


where = {PH{k)) is the variance for the hypothetical power 
spectrum Pnik). 

However, in practice one chooses a fiducial model with power 
spectrum P(fe) and estimates the variance Ck = ^P^(fc)^ for this 
particular choice: 

L (p{k)\p„{k),Ck) = -(17) 

^ ' V27vCk 


For mock based variance calculations, P(fe) is the cosmology of 
the mocks used in their analysis. 

We can again use Bayes’ theorem (cf. Eq. 1) and assume the 
same uniform prior as before to find the posterior. For the posterior 
assuming a Gaussian distribution in P(fc) with model-dependent 
covariance we have 


7(*) = \/—ln(a;) +x 


to make Eq. (10) look more quadratic: 


-21n(Pfl) = 2M 


( P(fc) 

\PH{k) 


2 

+ const. 


We can also define 


( 11 ) 


( 12 ) 


P d (Ph 


r{k)\P{k)) 


L (P(fc)| Pff(fe)) 
JdPH L (P(fe) Pp(fe)) 


exp 1 2 ^73 




(18) 


P,W = F,«=h(^) ,13) 

for some fiducial model with power Pf. Pj has then a symmetric 

~ 2P?(fc) 

Gaussian posterior with a fixed variance Ck = —evaluated 
for our fiducial model: 

—21n(P_R) = TP.yC’jT^P-y -I- const. (14) 

In general, things are more complicated than this simple pic¬ 
ture. For example, the survey geometry leads to a convolution of Sk, 
and non-linear effects distort the small scale mode distribution. Ide¬ 
ally, we would like to use a single distribution, and this should be 
matched to simulations (e.g. Blot et al. 2015). In order to broaden 
the choice, we also consider a number of forms for the likelihood 
inspired by CMB analyses. 


2.3 Common Approximations of the Likelihood/Posterior of 
the Power Spectrum 


Often, the power-spectrum is directly analysed, incorrectly assum¬ 
ing it follows a Gaussian distribution, thus the distribution of a finite 
empirical realisation of the power spectrum P{k) would read 


.Z 



exp 


f 1 [P(fc)-PT(fc)]''^ 
2 Cfc J 

v/23^ 


(15) 


where Ck = {PT{k)) = is the variance of the true power 

spectrum Pt at a bin centred around k comprising M independent 


where we adopt the subscript notation Vd of Hamimeche & Lewis 
(2008). Note that both the exponential and the covariance matrix 
cf depend on Pnik). 

If a fixed covariance is assumed, we have to apply the 
Bayesian Eq. (1) to Eq. (17) giving 

/ , [P(k)-PR(k)f \ 

/ 1^ Ck I 

Pf {PH{k)\P{k),Ck) <X - ^(19) 

^ ^ VffCk 

2.4 A Simple Test of the Posterior Shapes for the 
isotropically averaged power spectrum 

In this subsection we combine the single mode posterior func¬ 
tions to posterior functions of the band-power. We do not take any 
anisotropic effects, such as redshift space distortions, into account. 
This is conservative because the effective volume for higher mul¬ 
tipole moments (cf. Eq. 25) is smaller, therefore containing fewer 
independent modes and thence amplifying the effect of choosing 
different posterior shapes. 

In Gaussian cases, we suppose that our volume is large enough 
to accommodate M independent complex Gaussian distributed 
samples of 5k such that we can use 

Cab = J^Soika — kfP^Cka). (20) 

to calculate the covariance matrices at higher numbers of modes 
M. We can obtain the band-power version of Pr (^PH{k) | |(5fc| j 
by multiplying together the single mode expressions. 
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The three different posterior shapes of Ph are plotted in Fig. 1. 
In the top panel of Fig. 1, we plot single mode posterior distri¬ 
butions for which we adopt |(5fc| = 100 and Prik) = P{k) = 
— 2 

|(5fc| = 10000. Note that a different choice would shift the peak 

positions and normalisation factor, but preserve the shapes. We 
make two different choices for the fixed covariance to see the 
effect of making the wrong assumption. For the dotted red line, 
we choose the covariance matrix which corresponds to the true 
power spectrum i.e. Ck = 2P^{k) = 50000000, and for 

the dashed-dotted line, we consider that our guess of the power 
spectrum is 5 per cent lower than the actual power spectrum, i.e. 
Ck = 45125000. The panels in the middle and at the bottom of 
Fig. 1 show the posterior distributions for 10 and 100 independent 
modes respectively. 

Fig. 1 and 2 show that different choices of the covariance ma¬ 
trix provide very different posterior distributions for a small num¬ 
ber of modes, but if we can increase the number of independent 
modes, we see the effect of the central limit theorem and the poste¬ 
rior distribution functions become more and more similar. We ob¬ 
serve that the maximum of the fixed-covariance posterior always 
agrees with the true value, even if the wrong fiducial model has 
been chosen. However, if we choose the wrong covariance matrix, 
we over- or underestimate the error of our measurements. If we do 
not fix the covariance, the best fit, i.e. the maximum of the poste¬ 
rior, has an offset with regard to the true value, which decreases 
as the number of modes increases. We also notice the long right 
tails of the varying-covariance Gaussian and the posterior measured 
from |(5fc|. The logarithmic plot in Fig. 2 shows that the tails of all 
approximations disagree with the true posterior distribution. How¬ 
ever, Vd is closest to the truth. 


2.5 Application to a Real Survey 


We have seen that a Gaussian distribution for Pnik) is not a good 
approximation to the true Rayleigh distribution if the number of 
modes is small. In this section, we study whether this has an impact 
on a real survey. We will base our analysis on an analytic linear 
error for the power spectrum and errors, but use survey parameters 
for the data release 11 (DRl 1) of BOSS. For a real survey, we have 
to take into account that the discrete positions of the galaxies in 
a given survey are sampled from a continuous random field by a 
Poisson point process (Feldman et al. 1994). To take this sampling 
process into account,Eq. (4) becomes 

fo(fci - k2) [F’(fci) -I- , (21) 

and hence also 


V 


oc 


Pnik) -h n-i 


( 22 ) 


The average number density n = 2 x 10“^ calculated 

from the number of galaxies contained in the BOSS DRl 1 CMASS 
sample (690,826) and its survey volume Vs = 10 Gpc® (Anderson 
et al. 2013) assuming h — 0.7. For the covariance matrices of the 
Gaussians, we need to know the number of modes (Feldman et al. 
1994; Tegmark 1997) 


M = V„VeB{k), (23) 

where 

_ fcn Afc; 

" “ 27r2 



Figure 1. Comparison of different posterior distribution functions for 1,10, 
100 and 1000 independent modes (from top to bottom). The blue line rep¬ 
resents the product of single Rayleigh distributed modes (true posterior dis¬ 
tribution) and some of the approximations, such as the Gaussian posterior 
distribution with a model-dependent covariance (green), and the Gaussian 
posterior where the covariance is estimated for a fixed fiducial model (red). 
The posterior takes the form of the dotted red line if the fiducial and the true 
power spectra agree, the dashed-dotted line shows the effect of choosing a 
fiducial model of which the power spectrum is wrong by 5 per cent. 


is the fc-space-“volume” of the nth fc-bin centred at with width 
Akn, and 


nP{k) 

1 + nP{k) 


(24) 


VMk) = V(s 


2 


(25) 
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Figure 2. Same as the bottom panel of Fig. 1, but with a logarithmic ordi¬ 
nate. 


Table 2. Kullback-Leibler divergences of the different approximations with 
respect to the true Vn at different scales kn for BOSS DRl 1 CMASS. 


kn 

-DkL [Pd I IPr) 

-^^KL {pf 1 I'Pr) 

r>KL 

0.004 

0.0213221 

0.382926 

0.335588 

0.012 

0.00451188 

0.0341685 

0.0374833 

0.02 

0.00336263 

0.0138535 

0.0199032 


is the effective volume. Anderson et al. (2013) calculate the power 
spectrum in Fourier modes averaged over bin widths of Ak = 
O.OOS/i Mpc“^. The values of the fc-bin centres and their corre¬ 
sponding number of modes M are M=18, 180 and 500 in the three 
lowest fc-bins centred at fc = 0.004, 0.012 and 0.02 Mpc h~^. We 
model tbe measured power spectrum as P{k) = tPPwfk), where 
b = 1.87 is the large-scale bias and Piin(fc) is a linear power spec¬ 
trum produced by CAMB (Lewis et al. 2000). For the other mea¬ 
surement we take 1| = P(k) -|- n~^. The resulting posterior 
distributions for the three lowest fc-bins are plotted in Fig. 3. At the 
largest scales, i.e. k — 0.004/i Mpc“^, neither Vo or Vf match 
Pr. At fc = 0.012/1 Mpc“^ and k = 0.02h Mpc“^ Pf and Pd 
become more similar, but neither of them features the asymmetric 
shape of Pr. Additionally, Pj and Pd produce smaller error bars 
compared to Pr. We can also numerically compare the distribu¬ 
tions if we introduce the Kullback-Leibler (KL) divergence (Kull- 
back & Leibler 1951). A distribution Pi is “better” than P 2 , if the 
loss of information due to approximating the true distribution with 
Pi is less than the same loss caused by using P 2 as an approxima¬ 
tion. If we use a probability density function (pdf) g to approximate 
another pdf /, a measure of the loss of information is given by the 
KL divergence 

DKi^{g\\f) = j (26) 

The KL divergences given in Tab. 2 tell us the same story as Fig. 3. 
The KL divergences of the Gaussian approximation with a vary¬ 
ing covariance Pd is at all scales less than the KL divergence of 
Pf, i.e. Pd is a better approximation to the true Pr. On the down¬ 
side, its best fit has an offset with respect to Pr. We will therefore 
investigate alternative posterior shapes in the next section. 


ber of independent modes is low, which is the case at large scales, 
i.e. small values of k. A similar problem arises when cosmological 
models are fitted to cosmic microwave background (CMB) power 
spectra, which are Wishart distributed. Bond, Jaffe & Knox (2000), 
Smith, Challinor & Rocha (2006), Percival & Brown (2006) and 
Hamimeche & Lewis (2008) have studied alternative distribution 
shapes that approximate the Wishart distribution. We take a simi¬ 
lar approach to Verde et al. (2003) and Percival & Brown (2006) 
and expand the natural logarithm of Eq. (9) around the maximum 

PH(k) = 

-2 In {Pr) = 2m(^^-^-+‘^+0 +const. (27) 

This equation agrees to third order with the Taylor expansions of 
the logarithms of the following distributions: 


• the inverse cubic normal (ICN) distribution (Smith et al. 2006) 

2 


-21n(PicN) = ISCfc ^ \P{k) - P{k)^/^P„{k) 
• the offset log-normal (OLN) distribution 


-1/3 


(28) 


-21n(PoLN) = 2(l + a)C'fe- 


P{k) In 


( PH{k) + aP{k) \ 

V P{k)+aP{k) J 


(29) 


if a = -1/4, 

• and combinations of any of the distributions given in chapter 
5.1 of Percival & Brown (2006). 


We can see from Fig. 4 that the 3rd order diverges for large 
values of the model power spectrum Pr. Hence the optimal free 
parameter a might differ from a — —1/4. Therefore, we use the 
KL divergence to optimise a in the offset log-normal distribution 
'PoLN- It can be found to be a = —0.201 at fc = 0.004^^, a = 
—0.240 at fc = 0.012^^ and a — —0.242 at higher values of 
fc- PoLN peaks at the maximum of the true distribution Pr and it 
approximates the tails of the true distribution a bit better than the 
Gaussian approximations, but as Fig. 3 shows, it is still obviously 
different from Pr. 

The ICN distribution (Smith et al. 2006) fits the true distribu¬ 
tion better. Fig. 4 shows a remarkable agreement between Pr and 
PiCN- Writing both —2 ln(P_R) and —2 In(PicN) as Taylor series, 
we see that their Taylor coefficients are equal for fc ^ 3 and ap¬ 
proximately equal for much higher orders (cf. Appendix A). 


4 THE EFFECT ON /nl MEASUREMENTS 
4.1 Physical Model 

In this section, we test the effect of using different posterior distri¬ 
bution shapes on the inference of a real observable. The largest 
deviations between the posteriors are at small fc and we would 
therefore expect the largest effects for parameters dependent on 
these modes. At these scales, (local) primordial non-Gaussianity 
alters the biasing law between dark-matter halos and the underly¬ 
ing mass-density field (Dalai et al. 2008; Matarrese & Verde 2008; 


3 STUDYING ALTERNATIVE POSTERIOR SHAPES 

We have seen in the previous sections that the true posterior distri¬ 
bution Pr is not well approximated by either P/ or Pd if the num- 


® For realistic, noisy measurements of |(5fe| and P{k), PR{k) has to be 
replaced by Pff{k) + n~^ everywhere in this section. For simplicity, we 
do not write the noise explicitly. 
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PhC* = 0.02ft/Mpc) ‘ 

Figure 3. Posterior distribution functions of the hypothetical power spec¬ 
trum Pff(fc) for the three lowest fc-bins of BOSS DRll CMASS. The 
colour coding is the same as in Fig. 1, with the addition of the offset log¬ 
normal (OLN) posterior distribution plotted in magenta. 



Figure 5. Galaxy power spectra Pg calculated for different values of /nl 
divided by the galaxy power spectrum Pg{fNL = 0) of a universe with a 
Gaussian primordial density field. 

Slosar et al. 2008; Afshordi & Tolley 2008; Valageas 2010; Gian- 
nantonio & Porciani 2010; Schmidt & Kamionkowski 2010; Des- 
jacques et al. 2011), making /nl a perfect test parameter of our 
analysis. The parameter arises in models where the potential has a 
local quadratic term 

<& = <^ + /nl - (</")) . (30) 

The resulting alteration of the bias can be written as 

/nl) = bo + 56(/nl) + Ab{k, /nl), (31) 

where bo is the bias in a Universe without primordial non- 
Gaussianity, (5fo(/NL) is the scale-independent modification to 
the bias from the non-Gaussian form of the mass functions and 
(Schmidt & Kamionkowski 2010; Desjacques et al. 2011) 

Ab{k, /nl) ~ (6o — l)fNhA{k) (32) 

is the local scale-dependent correction due to the easier halo forma¬ 
tion with additional long-wavelength fluctuations, which depends 
on the critical density 5c{z) in the peak- background split model, 
as well as the matter transfer function T{k), the matter density Urn, 
the present-time Hubble parameter Ho and the linear growth func¬ 
tion D{z) through the parameter 



Figure 4. Third and fourth order Taylor expansion to the true posterior 
shape Vr with M = 20 modes. The x-axis is a perturbation e = 
Ph /P — 1 of the model power spectrum Ph around the average recov¬ 
ered best-fit value P. As the third order approximation is not normalisable, 
the normalisation has been chosen such that it agrees with the 4* order at 
the maximum. The true posterior shape agrees very well with the inverse 
cubic normal posterior shape. 


A{k, z) 


SQmSdz) /HoY 

k^T{k) \ c ) 


(33) 


As 56(/nl) ^ Ab{k, /nl) at our scales of interest (Slosar et al. 
2008; Giannantonio & Porciani 2010), we neglect (5fo(/NL). 

Fig. 5 shows the effect of /nl on the galaxy power spectrum 
at large scales. We plot the galaxy power spectrum Pg divided by 
the galaxy power spectrum at /nl = 0, hence what we plot is 
proportional to the square of Eq. (31). At lowest k, negative /nl 
enhances the power spectrum due to the fact that the term propor¬ 
tional to /nl dominates the total bias. At slightly higher k, but still 
at large scales, the term linear in /nl dominates, and the power is 
enhanced or decreased depending on the sign of /nl. At small, yet 
still linear, scales, A{k, z) becomes small, thus initial local non- 
Gaussianities do not have an effect on the galaxy power spectrum 
at these scales. 

Here we work to first order in 5, so that we can continue to as¬ 
sume that Sk is drawn from a Gaussian distribution, with an altered 
P{k), i.e. the first order effect of non-Gaussianity is to Pir(fe), 
keeping the distribution the same. Furthermore, we do not alter Vn 
(Eq. (24)) to include any coupling between modes from the non- 
Gaussian signal. Where k is very small, higher order corrections 
to S will become increasingly important (e.g. Tellarini et al. 2015), 
suggesting that the Gaussian limit for S will break down here. 
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Table 3. /nl- postdictions of the best fit ’ and marginalised best fits 
(/nl). 2s well as its 95% confidence interval, for BOSS DR9 using differ¬ 
ent shapes of the posterior distribution. 


posterior 

r(BF) 

•/nl 

(/nl) 

95% confidence interval 

Vr 

0 

11.4 

-71.5< /nl <100.7 

Vd 

-25.5 

-11.9 

-68.2< /nl <53.4 

Vf 

0 

-7.7 

-90.9< /nl <71.0 

VlCN 

0 

11.4 

-71.9< /nl <101.2 


4.2 BOSS Results 

We use BOSS DR9 parameters and the same CAMB linear matter 
power spectrum as Ross et al. (2013). We also assume Sc = 
as expected from the spherical collapse model in an Einstein-de 
Sitter universe and a fiat prior for /nl • We plot /nl posterior func¬ 
tions in Fig. 6, assuming a measurement of a power spectrum with 
underlying /nl = 0. Pf is not symmetric, as both a linear and a 
quadratic term of /nl enter the power spectrum. The inverse cubic 
normal distribution agrees again very well with Vr. Vr, Vf and 
PiCN reproduce the true value as their best fit estimate. Using Vo, 
the most likely value of /nl is /nl = —25.5 considering the same 
fc-bins as Ross et al. (2013) in their analysis of DR9 BOSS data, 
i.e. 0.004^ ^ fc < 0.05^. 

One has to keep in mind that there are different definitions of 
the measured value. The commonly published value is the poste¬ 
rior mean (/nl), due to the fact that if /nl is fitted as part of a 
longer list of cosmological parameters, one has to rely on Markov 
chain Monte Carlo techniques (e.g. Lewis & Bridle 2002). In gen¬ 
eral, such techniques cannot provide accurate estimates of the best- 
fit value. Hence, data analysis papers more often present (/nl) as 
their results. If the posterior is asymmetric, the best fit and posterior 
mean do not agree. Given a flat /NL-prior, we expect /nl = 11.4 
using Vr. Based on our arguments in Sec. 2.1 and 4.1, we think of 
the mean of Vr as the correct estimate of /nl • This seems counter¬ 
intuitive because our input was that we measure a power spectrum 
which corresponds to /nl = 0, but we have to consider that P{k) 
is a finite empirical realisation in our part of the universe corre¬ 
sponding to the value of /nl = 0 we have assumed we would 
measure locally, but due to the non-Gaussian shape of the posterior 
distribution, the ensemble average of /nl measured in other parts 
of the universe is higher than the value we set as an input for our 
local environment. 

Our results are summarised in Tab. 3. Picn reproduces the 
correct estimate of /nl, whereas Vd and Vf estimate /nl = 
— 11.9 and /nl = —7.7 respectively. The choice of the posterior 
distribution also affects the error estimation. If we use Vr or Picn, 
the length of our postdicted 95% /nl -confidence interval (C.I., cf. 
Tab. 3) is similar to the length of Ross et al. (2013)’s most naive 
case ii 95% C.I., i.e. 32 < /nl < 198. 


4.3 Euclid Results 

We make similar predictions for the Euclid survey (Laureijs et al. 
2011). We assume bias values b{z) = \/\ + z, matched to sim¬ 
ulations of Orsi et al. (2010) and also assumed in Amendola et al. 
(2013), and number densities n{z) predicted for Euclid by Pozzetti, 
Hirata & Geach (2015), and a survey covering 15000 square de¬ 
grees. We generate CAMB matter power spectra P{k, z) for the 
redshift range 0.9 < z < 1.74. Note that the aim of this article is 



Figure 6. Analytic /nl -posterior functions for a BOSS like survey com¬ 
bining all fc-bins. 



Figure 7. Analytic /nl -posterior functions for a Euclid like survey com¬ 
bining all fc-bins. 


to test how the use of different posterior shapes influences cosmo¬ 
logical measurements, but not primarily to make /NL-predictions. 
We refer to more rigorous predictions which can be found e.g. 
in Fedeli et al. (2011); Laureijs et al. (2011); Giannantonio et al. 
(2012); Yamauchi, Takahashi & Oguri (2014). These studies also 
include 3-point statistics, weak lensing tomography, measurements 
of the integrated Sachs-Wolfe effect and/or the use of the multi¬ 
tracer technique. Their constraints are therefore tighter than ours. 

As Euclid will probe a much larger volume, it will accommo¬ 
date many more fc-modes and hence we see good agreement ofVf 
with Vr in Fig. 7. As against our results in Sec. 2.5, fixing the co- 
variance provides better /nl results than the inferences from a pos¬ 
terior with varying covariance. However, Picn is still the best ap¬ 
proximation and accurately reproduces the marginalised /NL-value 
of Vr and its 95% C.L, whereas using Vf yields the correct width 
of the 95% C.L, but its position and the marginalised value have 
an offset of 0.38 (cf. Tab. 4). We therefore still recommend either 
using PicN or Vr when cosmological models are fitted to power 
spectra from galaxy surveys even as large as Euclid. 


Table 4. /nl -predictions similar to Tab. 3, but for Euclid. 


posterior 

.(BF) 

•/nl 

(/nl) 

95% confidence interval 

Vr 

0 

0.24 

-9.0< /nl <9.4 

Vd 

-1.0 

-0.30 

-8.4< /nl <7.8 

T’f 

0 

-0.14 

-9.4< /nl <9.0 

"PlCN 

0 

0.24 

-9.0< /nl <9.4 
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5 CONCLUSIONS 

We have studied different posterior shapes that can be used in 
the fitting process of cosmological models to power spectra from 
galaxy surveys. As the underlying matter density field is at least 
approximately Gaussian, we assume that the true posterior distri¬ 
bution Vr is based on a Rayleigh likelihood distribution in 5. As¬ 
suming Gaussian posteriors in P{k), be it with a fixed or a varying 
covariance matrix, does not approximate Vr well and yields biased 
best-fit values and wrong error estimates especially on large scales 
where statistics are not good enough to make use of the central limit 
theorem. 

If one confines oneself to use Gaussian posterior shapes, it 
depends on the parameter one wants to constrain whether a fixed 
or varying covariance matrix provides more accurate results. We 
found that the posterior shape Vd with varying covariance follows 
Vr closer than Vf with a fixed covariance when the power spec¬ 
trum Ph (or any parameter linear in the power spectrum) is fitted 
to the power spectrum P, but when /nl is fitted to P it is the other 
way round. 

Due to these reasons, we advise against using Gaussian pos¬ 
terior distributions. Instead, we have found that posterior distri¬ 
butions, such as the inverse cubic normal distribution "Picn (cf. 
Eq. (28)) or applying Hamimeche & Lewis (2008)’s method to Vr 
(cf. Eq. (14)), provide simple, more accurate alternatives. They con¬ 
fidently reproduce the correct width of the 95 % confidence inter¬ 
vals in our simplified predictions of /NL-measurements. However, 
the final decision about which posterior is the best to use should be 
done after testing these methods against simulations which account 
for the non-linear effects that we have ignored for simplicity in our 
analytic calculations. We leave this for future work. 

A major advantage of the non-Gaussian posteriors presented 
in this paper, is the fact that their covariance matrices do not depend 
on the power spectrum of the model to be tested. The estimation 
of covariance matrices is a critical and computationally expensive 
step in the data analysis. Extensions to configuration-space analy¬ 
ses based on the correlation function ^(r) are left for future work. 


ACKNOWLEDGMENTS 

The authors would like to thank Matteo Tellarini, Ashley Ross 
and David Wands for valuable discussions about primordial non- 
Gaussianity. We thank the referee Andrew laffe for his helpful 
comments. 

Some of the results in this paper have been generated using 
the CAMB package (Lewis et al. 2000). Eor some of the other re¬ 
sults, we made use of the facilities and staff of the UK Sciama High 
Performance Computing cluster supported by the ICG, SEPNet and 
the University of Portsmouth. 

WJP acknowledges support from UK STEC through the con¬ 
solidated grant ST/K0090X/1, and from the European Research 
Council through the Darksurvey grant. LS is grateful for sup¬ 
port from SNSE grant SCOPES IZ73Z0-152581, GNSE grant 
ER/339/6-350/14, and DOE grant DEEG 03-99EP41093. 


REEERENCES 

P. A. R. Ade et al. [Planck Collaboration], Astron. Astrophys. 571 
(2014) A24 [arXiv: 1303.5084 [astro-ph.CO]]. 

N. Afshordi and A. J. Tolley, Phys. Rev. D 78 (2008) 123507 
[arXiv:0806.1046 [astro-ph]]. 


L. Amendola et al. [Euclid Theory Working Group Collabora¬ 
tion], Living Rev. Rel. 16 (2013) 6 [arXiv: 1206.1225 [astro- 
ph.CO]]. 

L. Anderson etal. [BOSS Collaboration], arXiv: 1312.4877 [astro- 
ph.CO]. 

J. M. Bardeen, J. R. Bond, N. Kaiser and A. S. Szalay, Astrophys. 
J. 304 (1986) 15. 

L. Blot, P. S. Corasaniti, J. M. Alimi, V. Reverdy and Y. Rasera, 
Mon. Not. Roy. Astron. Soc. 446 (2015) 1756 [arXiv: 1406.2713 
[astro-ph.CO]]. 

I. R. Bond, A. H. Jaffe and L. E. Knox, Astrophys. J. 533 (2000) 
19 [astro-ph/9808264]. 

C. H. Chuang, F. S. Kitaura, F. Prada, C. Zhao and G. Yepes, 
Mon. Not. Roy. Astron. Soc. 446 (2015) 2621 [arXiv: 1409.1124 
[astro-ph.CO]]. 

S. Cole and N. Kaiser, Mon. Not. Roy. Astron. Soc. 237 (1989) 
1127. 

N. Dalai, O. Dore, D. Huterer and A. Shirokov, Phys. Rev. D 77 
(2008) 123514 [arXiv:0710.4560 [astro-ph]]. 

M. Davis, G. Efstathiou, C. S. Frenk, S. D. M. White, Astron. J. 
292(1985) 371 

K. S. Dawson et al. [BOSS Collaboration], Astron. J. 145 (2013) 
10 [arXiv: 1208.0022 [astro-ph.CO]]. 

V. Desjacques, D. Jeong and F. Schmidt, Phys. Rev. D 84 (2011) 
061301 [arXiv: 1105.3476 [astro-ph.CO]]. 

C. Fedeli, C. Carbone, L. Moscardini and A. Cimatti, Mon. Not. 
Roy. Astron. Soc. 414 (2011) 1545 [arXiv: 1012.2305 [astro- 
ph.CO]]. 

H. A. Feldman, N. Kaiser and J. A. Peacock, Astrophys. J. 426 
(1994) 23 [astro-ph/9304022]. 

T. Giannantonio and C. Porciani, Phys. Rev. D 81 (2010) 063530 
[arXiv:0911.0017 [astro-ph.CO]]. 

T. Giannantonio, C. Porciani, J. Carron, A. Amara and 
A. Pillepich, Mon. Not. Roy. Astron. Soc. 422 (2012) 2854 
[arXiv: 1109.0958 [astro-ph.CO]]. 

S. Hamimeche and A. Lewis, Phys. Rev. D 77 (2008) 103013 
[arXiv:0801.0554 [astro-ph]]. 

J. Hartlap, P. Simon and P. Schneider, Astron. Astrophys. [Astron. 
Astrophys. 464 (2007) 399] [astro-ph/0608064]. 

A. F. Heavens and A. N. Taylor, Mon. Not. Roy. Astron. Soc. 275 
(1995) 483 [astro-ph/9409027]. 

S. Kullback and R. A. Leibler, Ann. Math. Statist. 22 (1951), no. 
1, 79-86. 

R. Laureijs et al. [EUCLID Collaboration], arXiv: 1110.3193 
[astro-ph.CO]. 

A. Lewis and S. Bridle, Phys. Rev. D 66 (2002) 103511 [astro- 
ph/0205436]. 

A. Lewis, A. Challinor and A. Lasenby, Astrophys. J. 538 (2000) 
473 [astro-ph/9911177]. 

M. Manera, R. Scoccimarro, W. J. Percival, L. Samushia, 
C. K. McBride, A. Ross, R. Sheth and M. White et al., Mon. Not. 
Roy. Astron. Soc. 428 (2012) 2, 1036 [arXiv: 1203.6609 [astro- 
ph.CO]]. 

S. Matarrese and L. Verde, Astrophys. J. 677 (2008) L77 
[arXiv:0801.4826 [astro-ph]]. 

P. Monaco, T. Theuns and G. Taffoni, Mon. Not. Roy. Astron. Soc. 
331 (2002) 587 [astro-ph/0109323]. 

P. Norberg, C. M. Baugh, E. Gaztanaga and D. J. Croton, Mon. 
Not. Roy. Astron. Soc. 396 (2009) 19 [arXiv:0810.1885 [astro- 
ph]]. 

A. Orsi, C. M. Baugh, C. G. Lacey, A. Cimatti, Y. Wang and 
G. Zamorani, Mon. Not. Roy. Astron. Soc. 405 (2010) 1006 



Inference from GC: The effect of the P{k) posterior 9 


[arXiv:0911.0669 [astro-ph.CO]]. 

W. J. Percival, Mon. Not. Roy. Astron. Soc. 356 (2005) 1168 
[astro-ph/0410631]. 

W. J. Percival and M. L. Brown, Mon. Not. Roy. Astron. Soc. 372 
(2006) 1104 [astro-ph/0604547]. 

W. J. Percival et at [2dFGRS Collaboration], Mon. Not. Roy. As¬ 
tron. Soc. 353 (2004) 1201 [astro-ph/0406513]. 

W. J. Percival, A. J. Ross, A. G. Sanchez, L. Samushia, A. Burden, 

R. Crittenden, A. J. Cuesta and M. V. Magana et al ., Mon. Not. 
Roy. Astron. Soc. 439 (2014) 2531 [arXiv:1312.4841 [astro- 
ph.CO]]. 

L. Pozzetti, C. Hirata and J. Geach, in preparation. 

A. J. Ross, W. J. Percival, A. Carnero, G. b. Zhao, M. Manera, 
A. Raccanelli, E. Aubourg and D. Bizyaev et al ., Mon. Not. Roy. 
Astron. Soc. 428 (2013) 1116 [arXiv:1208.1491 [astro-ph.CO]]. 
D. Schlegel et al. [BigBoss Collaboration], arXiv: 1106.1706 
[astro-ph.IM]. 

F. Schmidt and M. Kamionkowski, Phys. Rev. D 82 (2010) 
103002 [arXiv: 1008.0638 [astro-ph.CO]]. 

R. Scoccimarro and R. K. Sheth, Mon. Not. Roy. Astron. Soc. 329 
(2002) 629 [astro-ph/0106120]. 

A. Slosar, C. Hirata, U. Seljak, S. Ho and N. Padmanabhan, JCAP 
0808 (2008) 031 [arXiv:0805.3580 [astro-ph]]. 

S. Smith, A. Challinor and G. Rocha, Phys. Rev. D 73 (2006) 
023517 [astro-ph/0511703]. 

S. Tassev, M. Zaldarriaga and D. Eisenstein, JCAP 1306 (2013) 
036 [arXiv: 1301.0322 [astro-ph.CO]]. 

A. Taylor, B. Joachimi and T. Kitching, Mon. Not. Roy. Astron. 
Soc. 432 (2013) 1928 [arXiv: 1212.4359 [astro-ph.CO]]. 

M. Tegmark, Phys. Rev. Lett. 79 (1997) 3806 [astro-ph/9706198]. 
M. Tellarini, A. J. Ross, G. Tasinato and D. Wands, 

arXiv: 1504.00324 [astro-ph.CO]. 

P. Valageas, Astron. Astrophys. 514 (2010) A46 

[arXiv:0906.1042 [astro-ph.CO]]. 

L. Verde et al. [WMAP Collaboration], Astrophys. J. Suppl. 148 
(2003) 195 [astro-ph/0302218]. 

L. Verde, R. Jimenez, L. Alvarez-Gaume, A. F. Heavens and 

S. Matarrese, JCAP 1306 (2013) 023 [arXiv: 1301.6017 [astro- 
ph.CO]]. 

M. White, Y. S. Song and W. J. Percival, Mon. Not. Roy. Astron. 
Soc. 397 (2008) 1348 [arXiv:0810.1518 [astro-ph]]. 

D. Yamauchi, K. Takahashi and M. Oguri, Phys. Rev. D 90 (2014) 
8, 083520 [arXiv: 1407.5453 [astro-ph.CO]]. 


The ICN distribution in terms of e is given by 

-2 In(PicN) = 9 [l - (1 -P " 

= 9[l-2(l-Pe)-i/®-p(l-P£)-^/®] . (A3) 

We make use of the generalised binomial series (1 + e)“ = 
0 (“)£"’ where (“) = *e generalised bi¬ 

nomial coefficient, and obtain the series 


—2 In(T’icN) = 9 

K=\ 



(A4) 


Again, we have ignored irrelevant constant terms. The negative 
entries in the binomial coefficients can be removed using (“) = 


—2 In(T’icN) 


9^(-1)-£- 

K.= l 




(A5) 


If we insert values for k 3, we find the equality 



Thus, Vr and "Picn are the same to third order. What is 
striking is that for larger K,the approximation 


(A6) 

even more 



(A7) 


still holds. For k < 17, the two sides differ by less than 20%. 
Therefore, the agreement between Vr and Picn is high. 


This paper has been typeset from a Tj^ LTjX file prepared by the 
author. 


APPENDIX A: COMPARISON OF Vr AND Picn TAYLOR 
SERIES 

In this appendix, we compare the Taylor Series of Vr and Picn to 

explain why they are so similar. We write the hypothetical power 
-- ~2 

spectrum Ph = (l-|-e)|(5fe| as a perturbation around the measured 
power. The Rayleigh posterior hence becomes 

-21n(Pfl) =21n(l-Pe)-P:j-^. (Al) 

Ignoring the irrelevant zero order contribution, the Taylor series 
reads 

OO 

-21n(P«) = 2^(-l)V 

K=1 


(A2) 



